An Initial Quality Analysis of the Ohloh Software Evolution Data
نویسنده
چکیده
Large public data sets on software evolution promise great value to both researchers and practitioners, in particular for software (development) analytics. To realise this value, the data quality of such data sets needs to be studied and improved. Despite these data sets being of a secondary nature, i.e., they were not collected by the people using them, data quality is often taken for granted, casting doubt on conclusions drawn from those data. This paper reports on an intial investigation of the quality of the software evolution data available on Ohloh, and further describes steps taken to cleanse the data set. Our goal is that other researchers, practitioners, and parties responsible for data sets such as Ohloh, use the outcomes of the validation and cleansing steps to improve quality of data sets in the public domain.
منابع مشابه
Discovering Determinants of Project Participation in an Open Source Social Network
Successful open source software projects often require a steady supply of self motivated software developers. However, little work has been done from a relational/network perspective to study the factors that drive the developers to participate in OSS projects. In this paper, we investigate the participation dynamics in a social network, particularly in an online open source community called Oh...
متن کاملTowards Base Rates in Software Analytics
Nowadays a vast and growing body of open source software (OSS) project data is publicly available on the internet. Despite this public body of project data, the field of software analytics has not yet settled on a solid quantitative base for basic properties such as code size, growth, team size, activity, and project failure. What is missing is a quantification of the base rates of such propert...
متن کاملExistence of Mild Solutions to a Cauchy Problem Presented by Fractional Evolution Equation with an Integral Initial Condition
In this article, we apply two new fixed point theorems to investigate the existence of mild solutions for a nonlocal fractional Cauchy problem with an integral initial condition in Banach spaces.
متن کاملInvestigating Relationships Between FLOSS Foundations and FLOSS Projects
Foundations function as vital institutional support infrastructures for many of the most successful open source projects, but the role of these support entities remains an understudied phenomenon in FLOSS research. Drawing on Open Hub (formerly known as Ohloh) data, this paper empirically investigates the different ways these entities support projects and interact with different projects and wi...
متن کاملEffective Strategies for Optimal Implementation of Evolution and Innovation Packages in Medical Education
ABSTRACT BACKGROUND AND OBJECTIVE: Evolution and innovation packages in medical science education are the main program of medical education and it is necessary to pay attention to the provision of infrastructure of their implementation. This study was conducted to identify effective strategies for optimal implementation of evolution and innovation packages in medical education. METHODS: The met...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- ECEASST
دوره 65 شماره
صفحات -
تاریخ انتشار 2014